This is a sequence classification model based on the DeBERTa v3 Large architecture, specifically designed to predict whether user prompts need to be grounded through external resources (such as web searches, databases, or RAG pipelines). This model acts as a routing layer in the LLM pipeline, helping to optimize retrieval decisions, latency, and costs.
Natural Language Processing
SafetensorsEnglish